NEW ESTIMATES AND TESTS OF INDEPENDENCE IN SOME 

COPULA MODELS 
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SALIM BOUZEBDA* & AMOR KEZIOU** 

Abstract. We introduce new estimates and tests of independence in copula models with 
unknown margins using (^-divergences and the duality technique. The asymptotic laws of 
the estimates and the test statistics are established both when the parameter is an interior 
point or not. 

Key words: Dependence function ; Multivariate rank statistics; Semiparametric infer- 
ence; Boundary value parameter. 
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1. Introduction and motivations 

Parametric models for copulas with unknown margins have been intensively investigated 
during the last decades. Copulas have become popular in applied statistics, because of 
the fact that they constitute a flexible and robust way to model dependence between 
the margins of random vectors. In this framework, semiparametric inference methods, 
base d on pseudo-lik e lihood , have been appli e d to c opulas by a numb er of authors (see, 



e.g. 



Shih and Louisl (119951 ) , IWang and Ding! (120001 ) , iTsukaharal (120051 ) and the references 



therein). Throughout the available literature, investigations on the asymptotic properties 
of parametric estimators, as well as the relevant test statistics, have privileged the case 
where the parameter is an interior point of the admissible domain. However, for most 
parametric copula models of interest, the boundaries of the admissible parameter spaces 
include some important parameter valu es, typicall y among which, that corresponding to 
the independence of margins. We find in D Jl997T l many examples of parametric copulas, 
for which marginal independence is verified for some specific values of the parameter 9, on 
the boundary <90 of the admissible parameter set C R d , d > 1. This paper concentrates 
on this specific problem. We aim, namely, to investigate parametric inference procedures, 
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in the case where the parameter belongs to the boundary of the admissible domain. 
In particular, it will become clear, that the usual limit laws both for parametric copula 
estimators and test statistics become invalid under these limiting cases, and, in particular, 
under marginal independence. Motivated by this observation, we will introduce a new 
semiparametric inference procedure based on <p- divergences and the duality technique. 
We will show that the proposed estimators remain asymptotically normal, even under 
the marginal independence assumption for appropriate choice of the divergence. This 
will allow us to introduce test statistics of independence, whose study will be made, both 
under the null and the alternative hypotheses. Let F(xi,x 2 ) := P{X\ < X\,X 2 < x 2 ) be 
a 2-dimensional distribution function, and Fi{xj) := P (Xj < Xj), i = 1,2 the marginal 
distribution of F. It is well known since the work of ISklarl (119591 ) that there exists a 
distribution function C on [0, l] 2 with uniform marginals such that 



C( Ul ,u 2 ) :^P{F 1 {X 1 )<u 1 ,F 2 (X 2 )<u 2 } 



See a 



(1991) 



so lDeheuveld (11979 bl ) . iDeheuvelsl (jl980l . ll98ll ). lMoore and Spruilll (119751 ) and lSchweizer 
Formally, copulas can be defined as follows. 



Definition 1.1. An p- dimensional copula is a function C : [0, l] p — > [0, 1] with the fol- 
lowing properties 

(1) C is grounded, i.e., for every u = (ui, . . . , Ud), C(u) = if at least one coordinate 
Ui = 0, i = l,...,p; 

(2) C , is p-increasing, i.e., for every u e [0, l] p and v e [0 , V \ p su c h that u < v, the 
C-volume Vc[u, v] of the box [u, v] is non negative (see \Nelsen\ {(19991) ): 

(3) C(l, . . ., 1, m, 1, . . ., 1) = Ui for all m G [0, If, m = 0, i = l,...,p. 



Many useful multivariate models for dependence between X\ and X 2 turn out to be 
generated by parametric families of copulas of the for m {Cp\ 9 € 9|, typically indexe d 



Kimeldorf and Sampson! (Il975al ). 



Jod ( 



19931 ) among others). In the 



by a vector valued parameter 6 6 8 C R d (se e, e.g., 
Kimeldorf and Sampson! (!l975bl ). iNelsenl (jl999l ). and 
sequel, we assume that Cg(-,-) admits a density c#(-, •) with respect to the Lebesgue 
measure A on R 2 . The nonparametric approach to copula estimation has been initiated 
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by iDeheuvelsl f 



addition, 
(see, also 



1979 bj), who intro duced and investigated the empirical copula process. In 



Deheuvelsl (11 980. 



Fermanian et al. 



981) described the limiting behavior of this empirical process 



(120041 ) and the references therein). In this paper, we consider 



the estimation and test problems for semiparametric copula models with unknown general 
margins. Let (Xik,X 2 k) k = 1, . . . , n be a bivariate sample with distribution function 
Fe T (-, ■) = Ce T (Fi(-), F 2 (-)) where 6t G is used to denote the true unknown value of the 
parameter. In order to estimate 8t, some semiparametric estimation procedures, based 
on the maximization, on the par ameter space 0, of properly c h osen p seudo-likelihood 



Qakes ( 



1994), 



Shi h and Louis! (119951 ). 



Liang and Sell 



criteri o n, have been proposed by 
(119961 ) . IWang and Dingl (120001 ) and iTsukaharal (120051 ) among others. In each of these 
papers, some asymptotic normality properties are established for ^/n[0 — 9t), where 
6 = 6 n denotes a properly chosen estimator of 6t- This is achieved, provided that 9t lies 
in the interior, denoted by 0, of the parameter space C R d . On the other hand, the case 
where 6? € <90 := — is a boundary value of 0, has not been studied in a systematical 
way until present. Moreover, it turns out that, for the above-mentioned estimators, the 
asymptotic normality of ^/n(6 — 9t), may fail to hold f or 9r € (90. In the framework of 
full parametric models with i.i.d. data, ISelf and Liang) (119871 ) summarized all the earlier 
work and provided the exact limiting distributions which are complicated by the number of 
unknown parameters, how many of them are on the boundary and the correlation between 
the components of the estimator proposed. Our approach is novel in this setting and it will 
become clear later on from our results, that the asymptotic normality of the estimate based 
on 0-divergences holds, even under the independence assumption, when, either, 8q is an 
interior, or a boundary point of 0, independently of the dimension of the parameter space. 
The proposed test statistics of independence using ^-divergences are also studied, under 
the null hypothesis Tio of independence, as well as under the alternative hypothesis. The 
asymptotic distributions of the test statistics under the alternative hypothesis are used to 
derive an approximation to the power functions. An application of the forthcoming results 
will allow us to evaluate the sample size necessary to guarantee a pre-assigned power level, 
with respect to a speci fied alterna t ive. T o establish our results, we use similar arguments 



as those developed by 



Tsukaharal (120051 ) in connection with the instrumental statements 
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on rank statistics established by iRuymgaart et al\ (119721 ) and iRiischendorfl (119761 ) among 



others, combined with a new technique (based on the law of iterated logarithm given 
in Lemma 16.11 below) to show both existence and consistency of our estimates and test 
statistics. All mathematical developments are relegated to the appendix. 



2. A NEW INFERENCE PROCEDURE 

Recall that the (^-divergences between a bounded signed measure Q, and a probability P 
on V, when Q -C P is absolutely continuous with respect to P, is defined by 



fdQ 



D ^ P ^-=JJ{dP) ^ 

where is a proper closed convex function from ] — oo, oof to [0, oof with 0(1) = and 
such that the domain dom0 := {x G R : 4>(x) < oo} is an interval with end points 
< 1 < b<f>. The Kullback-Leibler, modified Kullback-Leibler, x 2 , modified x 2 ; Hellinger 
and L l divergences are examples of 0-divergences; they are obtained respectively for 
(fi(x) = a; log a; — x + 1, 4>(x) = — logx + x — 1, 4>(x) = \{x — V) 2 , <j>(x) = , 
<f>(x) = 2(y / x — l) 2 and (f)(x) = \x — 1|. We refer to lLiese and Vajdal (119871 1 for a systematic 
theory of divergences. In the sequel, for all 9, we denote by D </,((), 9t) the 0-divergences 
between Cg and Cq t , i.e., 

= J $ (l^r~) dCg T (u 1 ,u 2 ) = [ (p ( Ce 



D^9,9 7 



dCo 



j d T J J i \ c e T 

Denote C n the empirical copula associated to the data, i.e., 



dCe T {ui,u 2 ) 



(2.1) 



1 n 

C n (ui,U 2 ) := -^2^{F ln (X lk )<u 1 }i{F 2n (X 2k )<u 2 }, (UI,U 2 ) e I, 



(2.2) 



k=l 



and Fj n {t) := - Y^k=i ^-}-oc,t]{Xjk), j = 1,2. In order to estimate the divergences 
D^(9, 9t) for a given 9 e in particular for 9 = 9q, and t he parameter 9t, we will m a 



ke use 



of the dual repre sentation of (^- divergences obtained by iBroniatowski and Kezioul (120061 ) 



Theorem 4.4 and 



Kezioul (120031 ) Theorem 2.3. By this, we readily obtain that D^Oq, Qt) 



can be rewritten into 



D^9,,9 T ) := sup ( // dC 6o - [ 4>*(f) dC dT 
feF Ui J i 



(2.3) 
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where 0* is used to denote the convex conjugate of 0, namely, the function defined by 

0* : t G R i— > 0*(t) := sup {tx — 0(x)} , 

and T is an arbitrary class of measurable functions fulfilling the following two condi- 
tions: V/ G J 7 , J |/| dCe is finite and (p'(dCe /dCe T ) G T . 

Furthermore, the sup in the above display is unique and is achieved at / = <p'(dCe /dCe T )- 
Note that the plug-in estimate 

'dC 6 



dC r , 



dC n (u 1 ,u 2 ) 



of D${9, 9t) is not well defined since Ce is not absolutely continuous with respect to C n ; 
the use of the dual representation, as we will show, avoids this problem. By the above 
statement, taking the class of functions 



f={iiG/Hf(l/c e ); #G9} 



we obtain the formula 



sup 

see 



du\dui 



ce 



dCe T (u 1 ,u 2 ) \ , (2.4) 



whenever Jj \<j)' (l/cg)\ du\du 2 is finite for all 9 G 0. Furthermore, the sup is unique 
and reached at 9 = 9t- Hence, the divergence D^,(9q,9t) and the parameter 9t can be 
estimated respectively by 



sup 

eee 



and 



arg sup 

eee 



du\du 2 — 



du\du 2 — 



1 

— < 

ce 



dC n (u 1 ,u 2 ) 



dC n (ui,u 2 ) 



(2.5) 



(2.6) 



in which Cg T is replaced by C n . Note that this class o f estim ates contains the maximum 
pseudo-likelihood (MPL) estimator proposed by lOakesi (119941 ); it is obtained for the KL m - 
divergence taking <p{x) = — log(x) + x — 1. Under some regularity conditions, we can prove 
that these estimates are consistent and asymptotically normal in the same way as the MPL 
estimate when the parameter 9t is an interior point of the parameter space G. The interest 
of divergence remains in the fact that a properly choice of the dive rgence may ameliorate 



the MPL one in terms of efficiency-robustness. The results in 



Bouzebda and Keziou 
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( 120081 ) show that, for = [80,00), and when the true value 8t of the parameter is equal 
to #0 (corresponding to the independence assumption), the classical asymptotic normality 
property of the MPL estimate is no more satisfied. To overcome this difficulty, in what 
follows, we enlarge the parameter space into a wider space e D 0. This is tailored to 
let #0 become an interior point of e . More precisely, set 



e := I 8 G R d such that J \(p'(l/cg(ui, u 2))\ du\du 2 < 00 
So, applying (12. 3p . with the class of functions 

T := {(ui,u 2 ) h-> <p'(l/ce{u 1 ,u 2 )); 8 e e }, 

we obtain 



(2.7) 



Dd>(8 ,8 T ) = sup I / (f)' ( — J du 1 du 2 - i 
0ee e KJi \ c eJ J 1 



1 

— < 



dC eT ( Ul ,u 2 )\ . (2.8 



Furthermore, the sup in this display is unique and reached in 8 = 8t- Hence, we propose 
to estimate D d 



'dyvour) by 

D(f,(8 , 8 T ) 
and to estimate the parameter 8t by 



sup / m(8,u 1 ,u 2 ) dC n (ui,u 2 ), 
6>ee e 



8,, 



arg sup 

0ee e 



m(8,ui,u 2 ) dC n (ui,u 2 ) 



(2.9) 



(2.10) 



where 
m(8,ui,u 2 ) 



ce(ui,u 2 ) 



du\du 2 —\ (f)' 



ce(ui,u 2 )J c e {ui,u 2 ) 



ce{ui,u 2 ) 



In the sequel we denote by -^m(8, u\,u 2 ) the ci-dimensional vector with entries -^-m(8, u\, u 2 ) 
and -^m(8,Ui,u 2 ) the d x c?-matrix with entries 9 q 2 qb m{8, u\, u 2 ). 

Remark 2.1. Divergences measures have been intensively used in estimation and test in 
the framework of the discret parametric models with independent identically distributed 
data; the e stimates of the d i vergen c es and the p arameter are obtained by the plug-in 



method; see 



Liese and Vaidd ((19871) . \Pardd \200w and the references therein. For con- 



tinuous param e tric models the p l ug-in p rocedure does not lead to well def ined estimates; 



Keziou (2003) . \Liese and Vajdd \200w . Wroniatowski and Keziou ((2008i) introduce new 
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estimates and tests, using the dual representation of divergences, extending the maximum 
likelihood procedure. 



3. The asymptotic behavior of the estimates 



In this section, we provide both weak and strong consistency of the estimates (12.91) and 
( I2.10j) . We also state their asymptotic normality and evaluate their limiting variance. 
Statistics of the form 



i/j(u 1 ,u 2 ) dC n (ui,u 2 ) 



belong to the general class of multivariate rank statistics. Their asymptotic properties 
have been investigated a t length by a num ber of authors, among whom we may cite 



Ruymgaart et all (119721 ). 



Ruymgaart 



(119741 ) and iRuschendorJ (119761 ). In particular, the 
previous authors have provided regularity conditions, imposed on which imply 

the asymptot i c nor mality of ty n . The corresponding arguments have been modified by 



Genest et al. 



(|l995T). as to estab l ish a 



consider (see, e.g., 



Genest et al. 



most sure convergence of the estimators that they 



(119951 ) Proposition A.l). In the same spirit, the limiting 



behavior, as n tends to the infinity, of the estimators and test statistics which we will 



introduce later on, will make an instrumental use of the gen eral theory o 



Genest et al. 



multivariate 



(119951 ). The 



rank statistics, and rely, in particular, on Proposition A.l in 
existence and consistency of our estimators will be established through an application 
of the law of the iterated logarithm for empirical copula processes, i n combinat i on wit h 
general arguments from multivariate rank statistics theory (we refer to Deheuvelsl ( 1979a ). 

(120041 ) and references therein). We will use the following notations 

1 



Fermanian et al 



K 1 (9,u 1 ,u 2 ) := <p' 



and 



K, 



Ui,U 2 



1 



ce(ui,u 2 



1 



1 



c e (ui,u 2 )J c e (ui,u 2 ) \ce(ui,u 2 ) 

Definition 3.1. (i) Let Q be the set of continuous functions q on [0, 1] which are posi- 
tive on (0, 1), symmetric about 1/2, increasing on [0, 1/2] and satisfy Jq {q(t)}~ 2 dt < 
oo. 
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(ii) A function r : (0, 1) — > (0, oo) is called u-shaped if it is symmetric about 1/2 and 
increasing on (0, 1/2]. 

(iii) For < /3 < 1 and and u-shaped function r, we define 



r(pt) 



if 0<t< 1/2; 



r{l-P(l-t)} if 1/2 < t < 1/2. 



If for (5 > in a neighborhood ofO, there exists a constant Mp, such that rg < Mpr 
on (0, 1), then r is called a reproducing u-shaped function. We denote by 1Z the 
set of reproducing u-shaped functions. 



We make use of the following conditions. 

(C.l) There exists a neighborhood N(9t) C O e of 9? such that the first and the second 
partial derivatives with respect to 6 of Ki(9,ui,u 2 ) are dominated on N(9t) by 
some A-integrable functions; 

(C.2) There exists a neighborhood N(9 T ) of 9 T , such that for all 9 € N(9 T ), the function 
J| m(9, ui, U2) '■ (0, l) 2 — > R is continuously differentiable and there exist functions 
r-i ElZ,r,i ElZ and q e Q (i, j = 1, 2, « 7^ j and £,£' — 1, . . . , m) with 



(i) 

(ii) 

(iii) 
(iv) 

fv) 



-^m{9,u 1 ,u 2 )\ < ri(ui)r 2 (u 2 ), 
Q^-.m{9,u u u 2 )\ <ri(ui)rj(uj); 
m 2 (9,u 1 ,u 2 )\ < r^n^r^n,); 
-^K 2 (9,U!,u 2 )\ <ri(ui)rj(uj); 
m(9,u 1 ,u 2 )\ < r^Uiy^Uj); 
J^m(0,ni,n 2 )| 2 < ^(n^r^n,), 
: m(0, ni,n 2 )| < ^(n^r^n,); 



1 d^ae^ 1 

( vi ) IA m ^' Ml ' M2 )l - f i(^) r i( M i) 

and / 7 {ri(ni)r 2 (n2)} 2 dC eT (ni,n2) < 00, J T {^(n^r^n^^-^)} dCg T (u 1 , u 2 ) < 00; 
(C.3) The matrix f(d 2 /d 2 9)m(9,ui,u 2 )dCg T (ui,u 2 ) is non singular; 
(C.4) The function (u±, u 2 ) E I 1— > m(9 T , n 1; n 2 ) is of bounded variation on J. 



Theorem 3.1. Assume that conditions (C.l-3) hold. 
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(1) Let .B(#r,n -1 / 3 ) := £ B e , \9 — 0t\ < n^ 1 / 3 } , then as n tends to infinity, with 
probability one, the function 9 t— > J m{6,ui,U2) dC n {u\,U2) attains its maximum 
value at some point 9 n in the interior of B(9T,n^ 1 ^ 3 ), which implies that the 
estimate 9 n is consistent and satisfies 





09 



m(9 n ,u ll u 2 ) dC n (u 1 ,u 2 ) = 0. 



(2) \fn[9 n — 9) converges in distribution to a centered multivariate normal random 
variable with covariance matrix 



= S^MS" 1 , 



with 

and 
M := 

where 



Var 



f d 2 

S := - / -^m(9 T ,u 1 ,u 2 )dCe T {ui,u 2 ) 



^m(9 T , F 1 (X 1 ), F 2 (X 2 )) + W 1 (9 T , X x ) + W 2 (9 T , X 2 ) 
o9 



Wi(9 T ,Xi) := J^t {Fi{ x t )<u l }g§^-m(9 T ,u 1 ,u 2 )c eT (u 1 ,u 2 ) du ± du 2 , i = 1,2. 
The proof of Theorem 13.11 is postponed to section 6. 



(3.1) 



(3.2) 



(3.3) 



4. New tests of independence 

One of our motivation is to build a statistical test of independence, based on ^-divergence. 
In the framework of the parametric copula model, the null hypothesis, i.e., the indepen- 
dence case Cq(ui,u 2 ) = U\U 2 corresponds to Hq '■ 9 T = 9 . We consider the composite 
alternative hypothesis 7i\ : 9t ^ 6q. Since, 9q is a boundary value of the parameter space 
9, we can see that the convergence in distribution of the c orresponding pseudo-likelihoo d 
ratio statistic to a x 2 random variable does not hold; see Bouzebda and KezioJ ( 2008 ). 
We give now a solution to this problem. We propose the following statistics 



2n - — - 



(4.1) 
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We will see that the proposed statistic converges in distribution, under the null hypothesis 
TCo, to a x 2 random variable with d degrees of freedom, which permits to build a test of 
Tio against 7Yi asymptotically of level a. The limit law of T n is given also under the 
alternative hypothesis We will use the following additional conditions 

(C.5) We have 

d 2 

hm m(0, Mi, u 2 ) = 0, 

9^e d6 e dui 

and there exist Mi > and 5\ > such that, for all 9 in some neighborhood of 
6*o, one has, for i = 1, 2, 



1 d9idu„ 

where r(u) := u(l — u) for u G (0, 1). 

Remark 4.1. When 6t = , under the conditions (C.l) and (C.5) we can see that S 
and M can be written as 



M 



dCe T {u 1 ,u 2 ) 



Theorem 4.1. (1) Assume that conditions (C.l-5) hold. If 6? = Bo, then the statistic 
T n converges in distribution to a \ 2 variable with d degrees of freedom. 
(2) Assume that conditions ( C. 1-4 ) hold. If9r 7^ do, then y/n (d^Oq, 9t) — D</>(9o, 0t) 
converges in distribution to a centered normal variable with variance 

0-K60, 6 T ) := Var [m(6 T , F 1 (X 1 ), F 2 (X 2 )) + Y 1 {6 T , X x ) + Y 2 (6 T , X 2 )] , (4.2) 

where 

Yi(8 T ,Xi) := J t{ Fi ^ i )<u i }^rn(6 T ,ui,u 2 )ce T (ui,u 2 ) du x du 2 , 1 = 1,2. 
The proof of Theorem 14.11 is postponed to section 6. 



Remark 4.2. An application of Theorem 4-1, leads to reject the null hypothesis Tio : 9t 



6*0, whenever the value of the statistic T n exceeds qi- a , namely, the (1 — a)-quantile of the 
X 2 law with d degrees of freedom. The corresponding test is then, asymptotically of level 
a, when n — > oo. The critical region is, accordingly, given by 

CR := {T n > . 
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The fact that this test is consistent follows from Theorem \4-l\ Further, this theorem can 
be used to give a n approximation to the p o wer fu nction ^£6h P(9t) '■= Pe T {CR} in 
a similar way to \Keziou and Leoni-Aubin (2008) . We so obtain that 



i - $ 



n 



gl-a 

2n 



?0, VT, 



(4.3) 



where $ denotes, as usual, the cumulative distribution function of a N(0, 1) standard 
normal random variable. A useful consequence of lji4-3\ ) is the possibility of computing 
an approximate value of the sample size ensuring a specified power (3{6t), with respect to 
some pre-assigned alternative 6t ^ do- Let uq be the positive root of the equation 



which can be rewritten into 



n 



qi-a 



6 T ) V In 



n 



(a + b) - y/a(a + 26) 



2D,(0 O , T ? 

where a := ct^q, 6*t) ( < & _1 (1 — P)) 2 and b := qi- a D^(6o, 9 T ). The sought-after approxi- 
mate value of the sample size is then given 



n* := [n \ + 1, 



where [u\ denote the integer part of u. 



Remark 4.3. From Theorem \3.1\ and\4-l\ it is clear that an asymptotic 1 — a confidence 



interval or region, 1Z n about 6 ca n be ea sily constructed using the Intersection method as 



described in 



Fena and McCullocli hi 992 ). 



Remark 4.4. The above regularity conditions are satisfie d by a l arge number of parametric 
families of bivariate copulas; see for instance I Tsukahara (200a) . 



Remark 4.5. The parameters A3. 2\) and A3.3\) may be consistently estimated respectively 
by the sample mean of 

d 2 



-^m(8 n , F ln (X hk ), F 2n (X 2 , k )), k=l,...,n, 



(4.4) 
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and the sample variance of 

> (e n , F ln (X ljk ), F 2n (X 2 , k )) + Wx(9 n ,X hk ) + W 2 (9 n ,X 2 , k ), k = l,...,n, (4.5) 



as was done in 



Genest et al. 



(11993) . The asymptotic variance (4-<ty can be consistently 



estimated in the same way. 



5. Concluding remarks 



We have introduced a new estimation and test procedure in parametric copula models 
with unknown margins. The methods is based on divergences between copulas and the 
duality technique. It generalizes the maximum pseudo-likelihood one, and applies both 
when the parameter is an interior or a boundary value, in particular for testing the null 
hypothesis of independence. It will be interesting to investigate the problem of the choice 
of the divergence which leads to an "optimal" (in some sense) estimate or test in terms 
of efficiency and robustness. 

6. Appendix 

First we give a technical Lemma which we will use to prove our results. 

Lemma 6.1. Let Fg T have a continuous margins and let Cg T have continuous partial 
derivatives. Assume that j is a continuous function, with bounded variation. Then 

j( Ul , u 2 ) d (C n (ui, u 2 ) - C( Ul , u 2 )) = O (^(loglogn) 1 / 2 ) (a.s.). (6.1) 



Proof of Lemma 16.11 Recall that the modified em pirical copula C n , is slightly different 
from the empirical copula C n (ui,u 2 ), introduced by iDeheuveld (|1979al ). and defined by 



<C„K,u 2 ) = F n ^F-l" 1 (ui) , F 2 * (w 2 )) for ( Mi ,m 2 ) G (0,1) 



(6.2) 



where F ln (•) and F 2j ^(-) denote the empirical quantile functions, associated with Fi n (xi) 
F n (xx,oa) and F 2n (x 2 ) = F n (oo,x 2 ), respectively, and defined by 



F-\t) := inf {x e R | F jn (x) > t}, j = 1, 2. 



(6.3) 
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Here, F n (-, •) denotes the joint empirical distribution function, associated with the sample 
{(X lk , X 2k ); k = 1, . . . , n}, defined by 



1 n 



{X 2k <x 2 }, -oo < X]_,X 2 < oo. 



(6.4) 



k=l 



We know that C n and C n coincide on the grid {(i/n,j/n) , 1 < i < j • < n} . The subtle 
difference lies in the fact that C„ is left-continuous with right-hand limits, whereas C n on 
the other hand is right continuous with left-hand limits. The difference between <C n and 
C n , however, is small 



sup |C„(u) — C n (u)| < max 



c 



I J 



n I ; 

n n 



i-l j-l 



n n 



< 



n 



(6.5) 



Using integration by parts, as in 



Fermanian et al. 



(120041 ) . we see that 



j(u 1 ,u 2 ) d(C n - C)(u 1 ,u 2 ) = / ^/n(C n - C)(ui,u 2 ) dj(ui,u 2 ) 



\/n(C n - C)(ui, 1) dj(ui,u 2 ) - / \/n(C n - C)(l,u 2 ) dj(ui,u 2 ) 



y/n{C n {u u 1) - ui) dj{u u I) 



Hence, 



[0,1] 



n \ j(ux,u 2 ) d(C n - C)(ux,u 2 ) 



\/n(C n (l,u 2 ) -u 2 ) dj(l,u 2 ). 



[0,1] 



<5v^sup|(L7 n -C)(u)| / d\j(u)\. 
ue/ J i 



From this and (16. 5j) . applying Theorem 3.1 in iDeheuveld (Il979al ). we obtain the following 
result 

j( Ul ,u 2 ) d(C n -C)(u 1 ,u 2 ) = 0(n- l ^(\oglogn) 1 / 2 ) (a.s.). 



Proof of Theorem 13.11 (1) Under the Assumption (C.l) and (C.2.iii), a simple calcula- 
tion gives 



d_ 

j 00 



m(9,ui,u 2 )dCg T (ui,u 2 ) = 0, 



and 



d 2 



m(9, Mi, u 2 )dCe T (ui, u 2 ) 



1 \ c 9t c i 



T^e T 



dX 



°8t J c i 



(6.6) 



(6.7) 
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We see that the matrix S is symmetric and positive using the fact that the second deriv- 
ative (j) (•) is nonnegative by the assumption that the function </>(•) is convexe. Hence, S 
is positive definite by (C.3). Introduce the statistic <& n (0r) defined by 

a .. .,. . 



^n{d T ) := J -7^m(9 : ui,u 2 ) dC n (u 1 ,u 2 ) 



and combine (16.61) with Theorem 2.1 in lRuymgaart et all (119721 ) to show that, as n — > oo 



V5£* n (0r) A JV(0,M), 
where M is defined in (13.31). Denote 



n\VT 



! de 2 



m(9,ui,u 2 ) dC n (ui,u 2 ), 



(6.9) 



(6.10) 



Gcncst et al. 



we ma ke use of (16. 7p and (C.2.v) in combinaison with Proposition A.l of 
(119951 ). one finds 

T n (9 T ) - -S, (a.s.). (6.11) 

We recall that S is in ( 13. 2ft . Now, for any 9 = 6t + vn~ l l 3 with |i>| < 1, consider a Taylor 
expansion of J m(9,Ui,u 2 ) dC n (ui,u 2 ) in 6 1 around 9t, and use (16. lip , and (C.2.iii) to 
obtain 



n J m(9,ui,u 2 )dC n (ui,u 2 ) —n J m(9 T ,u 1 ,u 2 ) dC n {u u u 2 ) = 
n 2 / 3 v T $ n {9 T ) + 2- 1 n 1 / 3 vSv T + 0(1) (a.s.) (6.12) 

uniformly in v with |u| < 1. On the other hand, since J -^m(9T,ui,u 2 ) 2 dCg T (ui,u 2 ) < 
oo, and ^-m(9, -,-) is of bounded variation by assumption (CA)(£ = l,...,d), using 
Lemma 16.11 we can show that 

I -!Lrn{9 T ,Ui,u 2 ) dC n (u u u 2 ) = O (n^ 2 (log log n) 1 ' 2 ) (a.s.). (6.13) 
J i ode 

Therefore, using (16. 1 2[) and (I6.13p . we obtain for any 9 = 9j> + vn^ 1 ^ 3 with \v\ = 1: 

n J m(9, ut,u 2 ) dC n (ui,u 2 ) -n J m(9 T ,u 1 ,u 2 ) dC n (ux,u 2 ) 

< 0(n 1/6 (loglogn) 1/2 ) - 2-\9n 1/3 + 0(1) (a.s.), (6.14) 

where $ is the smallest eigenvalue of the matrix S. Observe that $ is positive since S is 
symmetric, positive and non singular by assumption (C.3) Using (I6.14p and the fact that 
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the function 9 \— > fjin^Or, Ui, u 2 ) dC n {ui,U2) is continuous, we conclude that as n — > oo, 
with probability one, 9 i— > J 7 itl(6t, U\, u 2 ) dC n (ui, u 2 ) reaches its maximum value at some 
point 9 n fulfills Jj ^m{6 n ,ui,u 2 ) dC n (ui,U2) = and \6 n — 9t\ = 0(n -1 / 3 ). 



(2) Using the first part of Theorem I3.1[ by a Taylor expansion of 

d 



,^m(9 n ,ui,u 2 ) dC n (u 1 ,u 2 ) 



in 6 n around 9t, we obtain 



/d ^ f d 2 

— m(0 T ,u 1 ,u 2 ) dC n (ui,u 2 ) + (6 n ~ 0t) T j qq^(°t, Ui,u 2 ) dC n (u 1 ,u 2 ) + o(n~ 1/2 ) 

Hence, 

V^(e n -6 T ) = - [T^OtT 1 V^<S>u(6t) + o P (l) (6.15) 
Using (16.91) and (16.111) . by Slutsky theorem, we conclude then 

V^(fi n -9 T ) ->JV(0,E^), (6.16) 

where we recall that is defined in (13.11) . ■ 
Proof of Theorem 14.11 

(1) Assume that 9t = . Hence, from (I6.15p . using (16. 7p . we obtain 

(On - 6 T ) = -S- 1 ^^^) + o P (l). (6.17) 

On the other hand, expanding in Taylor series 

2n - — - — 2n f ^ 
-^jYjD^Oo^n) = -^jYj j m(9 n ,ui,u 2 ) dC n (ui,u 2 ) 

in 8 n around 6t, in combination with the fact that j I m{9T-,ui J u 2 ) dC n (ui,u 2 ) = 0, we 
get 

- -D^(e ,e n ) = -—®4e T )(6 n -d T )-—-(e n -d T ) T r n (e T )(6 n -d T )+o P (i). (e.is) 



0"(i) vv u ' nj ret) ^"(i; 

Using (16. 7p . (I6.17P and the fact that S = <fi"(l)Ie T {h T denotes the Fisher information 
matrix) when 9t = 0q to obtain 

2n - — 1 ^ 

_D,(0OA) = ^V^n(W^V^<W). (6.19) 
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Finally, use the convergence in ( 16. 9ft and the fact that M = <f>' (1)Iq t when 6t = 0o, to 
conclude that [■prfc]D<j>(9o, 9t) converge in distribution to a x 2 variable with d degrees of 
freedom when 9t = do- B 

(2) Assume that 9t ^ 0o, using Taylor expansion again of 

D^,{0 t ,0q) = J m{O n ,ui,u 2 ) dC n {ui,u 2 ) 

in n around Ot, combined with the fact that J z {ui,u 2 ) = 0, we 

obtain from part (2) of Theorem 13.11 

J m{0 n ,ui,u 2 ) dC n (u 1 ,u 2 ) = J m(0 T ,Ui,u 2 ) dC n (ui, u 2 ) + o P (n~ 1/2 ). 



Hence 



V^{d^Oo,Ot)-D^Oo,0j 



n[ I m(6 T ,ui,u 2 ) dC n (u 1 ,u 2 ) - J m{6 T ,u l ,u 2 ) dCg T (ui,u 2 ) ) +o P (l), 



which under assumption (C.2.ii) and (C.2.vi) by Theorem 2.1 in lRuymgaart et al\ ( 11972h 
once more, converges to a centred normal variable with variance given in (14.21) . ■ 
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References 

Bouzebda, S. and Keziou, A. (2008). A test of independence in some copula models. 
Mathematical Methods of Statistics, 17(2), 1-15. 

Broniatowski, M. and Keziou, A. (2006). Minimization of (^-divergences on sets of signed 
measures. Studia Sci. Math. Hungar., 43(4), 403-442. 

Broniatowski, M. and Keziou, A. (2008). Parametric estimation and tests through diver- 
gences and duality technique. To appear in J. Multivariate Anal. 

Deheuvels, P. (1979a). La fonction de dependance empirique et ses proprietes. Un test 
non parametrique d'independance. Acad. Roy. Belg. Bull. CI. Sci. (5), 65(6), 274-292. 



NEW ESTIMATES AND TESTS OF INDEPENDENCE IN SOME COPULA MODELS 17 

Deheuvels, P. (1979b). Proprieties d'existence et proprietes topologiques des fonctions de 

dependance avec applications a la convergence des types pour des lois multivariees. C. 

R. Acad. Sci. Pans Ser. A-B, 288(2), A145-A148. 
Deheuvels, P. (1980). Nonparametric test of independence. In Nonparametric asymptotic 

statistics (Proc. Conf., Rouen, 1979) (French), volume 821 of Lecture Notes in Math., 

pages 95-107. Springer, Berlin. 
Deheuvels, P. (1981). A Kolmogorov-Smirnov type test for independence and multivariate 

samples. Rev. Roumaine Math. Pures AppL, 26(2), 213-226. 
Feng, Z. and McCulloch, C. E. (1992). Statistical inference using maximum likelihood esti- 
mation and the generalized likelihood ratio when the true parameter is on the boundary 

of the parameter space. Statist. Probab. Lett., 13(4), 325-332. 
Fermanian, J.-D., Radulovic, D., and Wegkamp, M. (2004). Weak convergence of empirical 

copula processes. Bernoulli, 10(5), 847-860. 
Genest, C, Ghoudi, K., and Rivest, L.-P. (1995). A semiparametric estimation procedure 

of dependence parameters in multivariate families of distributions. Biometrika, 82(3), 

543-552. 

Joe, H. (1993). Parametric families of multivariate distributions with given margins. J. 
Multivariate Anal, 46(2), 262-282. 

Joe, H. (1997). Multivariate models and dependence concepts, volume 73 of Monographs 
on Statistics and Applied Probability. Chapman & Hall, London. 

Keziou, A. (2003). Dual representation of 0-divergences and applications. C. R. Math. 
Acad. Set. Pans, 336(10), 857-862. 

Keziou, A. and Leoni-Aubin, S. (2008). On empirical likelihood for semiparametric two- 
sample density ratio models. Journal of Statistical Planning and Inference, 138(4), 
915-928. 

Kimeldorf, G. and Sampson, A. (1975a). One-parameter families of bivariate distributions 
with fixed marginals. Comm. Statist., 4, 293-301. 

Kimeldorf, G. and Sampson, A. (1975b). Uniform representations of bivariate distribu- 
tions. Comm. Statist., 4(7), 617-627. 



18 SALIM BOUZEBDA* & AMOR KEZIOU** 

Liang, K.-Y. and Self, S. G. (1996). On the asymptotic behaviour of the pseudolikelihood 
ratio test statistic. J. Roy. Statist. Soc. Ser. B, 58(4), 785-796. 

Liese, F. and Vajda, I. (1987). Convex statistical distances, volume 95. BSB B. G. Teubner 
Verlagsgesellschaft, Leipzig. 

Liese, F. and Vajda, I. (2006). On divergences and informations in statistics and infor- 
mation theory. IEEE Trans. Inform. Theory, 52(10), 4394-4412. 

Moore, D. S. and Spruill, M. C. (1975). Unified large-sample theory of general chi-squared 
statistics for tests of fit. Ann. Statist., 3, 599-616. 

Nelsen, R. B. (1999). An introduction to copulas, volume 139 of Lecture Notes in Statistics. 
Springer- Verlag, New York. 

Oakes, D. (1994). Multivariate survival distributions. J. Nonparametr. Statist., 3(3-4), 
343-354. 

Pardo, L. (2006). Statistical inference based on divergence measures, volume 185 of Sta- 
tistics: Textbooks and Monographs. Chapman & Hall/CRC, Boca Raton, FL. 

Ruschendorf, L. (1976). Asymptotic distributions of multivariate rank order statistics. 
Ann. Statist, 4(5), 912-923. 

Ruymgaart, F. H. (1974). Asymptotic normality of nonparametric tests for independence. 
Ann. Statist, 2, 892-910. 

Ruymgaart, F. H., Shorack, G. R., and van Zwet, W. R. (1972). Asymptotic normality 
of nonparametric tests for independence. Ann. Math. Statist., 43, 1122-1135. 

Schweizer, B. (1991). Thirty years of copulas. In Advances in probability distributions 
with given marginals (Rome, 1990), volume 67 of Math. Appi, pages 13-50. Kluwer 
Acad. Publ., Dordrecht. 

Self, S. G. and Liang, K.-Y. (1987). Asymptotic properties of maximum likelihood estima- 
tors and likelihood ratio tests under nonstandard conditions. J. Amer. Statist. Assoc., 
82(398), 605-610. 

Shih, J. H. and Louis, T. A. (1995). Inferences on the association parameter in copula 
models for bivariate survival data. Biometrics, 51(4), 1384-1399. 

Sklar, M. (1959). Fonctions de repartition a n dimensions et leurs marges. Publ. Inst. 
Statist Univ. Paris, 8, 229-231. 



NEW ESTIMATES AND TESTS OF INDEPENDENCE IN SOME COPULA MODELS 19 

Tsukahara, H. (2005). Semiparametric estimation in copula models. Canad. J. Statist., 
33(3), 357-375. 

Wang, W. and Ding, A. A. (2000). On assessing the association for bivariate current 
status data. Biometrika, 87(4), 879-893. 

*LSTA-Universite Paris 6, 175, rue du Chevaleret, Boite 158, 75013 Paris, bouzebda® 
ccr.jussieu.fr 

**Laboratoire de Mathematiques (UMR 6056) CNRS, Universite de Reims Champagne- 
Ardenne and LSTA-Universite Paris 6. UFR Sciences, Moulin de la Housse, B.P. 1039, 
51687 Reims, France, amor.keziou@univ-reims.fr 



